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Abstract. We investigate the degree distribution resulting from graph genera- 
tion models based on rank-based attachment. In rank-based attachment, all ver- 
tices are ranked according to a ranking scheme. The link probability of a given 
vertex is proportional to its rank raised to the power —a, for some a G (0, 1). 
Through a rigorous analysis, we show that rank-based attachment models lead 
to graphs with a power law degree distribution with exponent 1 + 1/ a whenever 
vertices are ranked according to their degree, their age, or a randomly chosen fit- 
ness value. We also investigate the case where the ranking is based on the initial 
rank of each vertex; the rank of existing vertices only changes to accommodate 
the new vertex. Here, we obtain a sharp threshold for power law behaviour. 
Only if initial ranks are biased towards lower ranks, or chosen uniformly at ran- 
dom, we obtain a power law degree distribution with exponent 1 + 1/a. This 
indicates that the power law degree distribution often observed in nature can be 
explained by a rank-based attachment scheme, based on a ranking scheme that 
can be derived from a number of different factors; the exponent of the power 
law can be seen as a measure of the strength of the attachment. 



1. Introduction 

The occurrence of power law degree distributions in self-organizing networks 
such as the web graph is often explained by a model based on the principle of 
Preferential Attachment (PA). In the original PA model proposed by Barabasi 
and Albert [2], new vertices join a graph one by one, and each new vertex chooses 
a pre-determined number of neighbours at random, so that the probability that a 
vertex is chosen as a neighbour (its link probability) is proportional to its degree. 
Analysis shows that this model indeed generates power law graphs with high 
probability, where the exponent of the power law equals 3 [2J [5] . More general 
PA models, such as the ones proposed and analyzed in |I] and [6] allow for the 
creation of edges between existing vertices and the deletion of edges and vertices. 
The power law of the degree distribution in this case depends on the probabilities 
with which various kind of steps (edge addition, vertex addition, deletion) are 
taken. By varying these probabilities, any exponent in the range (2, oo) can be 
obtained. In [3], the preferential attachment with fitness is studied, in which 
the degree of a vertex is scaled by its individual fitness factor to determine its 
attractiveness. 
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It is tempting to conjecture that the exponent of the power law can be controlled 
by varying the strength of preferential attachment. Precisely, the link probability 
could be proportional to the degree raised to a power a; the hope is that the 
exponent of the power law would be a function of a. This would give a more 
natural way to fit the model to real-life data than that given by the generalized 
addition/deletion models: the ratio of edge vs. vertex addition steps may well 
be dictated by considerations about the data that are independent of the power 
law. Unfortunately, as pointed out in [IT] , this approach does not work. Only 
the case where a = 1 (i.e. the standard PA model) leads to a power law degree 
distribution. If a < 1 (weak preferential attachment), the degree distribution is a 
stretched exponential, while if a > 1 (strong preferential attachment), the graph 
will be close to a star, with one vertex adjacent to almost all other vertices. 

In this paper, we show that the approach outlined above does give the proposed 
results if the preferential attachment is based on a ranking of the vertices. In other 
words, the vertices are ranked from 1 to n according to their degree (so the vertex 
with highest degree has rank 1, etc.), and the link probability of a given vertex is 
proportional to its rank, raised to the power —a for some a G (0, 1); we will refer 
to a as the attachment strength. (Negative powers are chosen since a low value 
for rank should result in a higher link probability.) Then, with high probability, 
the resulting graph will have a power law with exponent 1 + 1/a. The rank-based 
approach was first proposed by Fortunato, Flammini and Menczer in [7J, and the 
occurrence of a power law was postulated based on simulations. 

In [TJ, the attachment strength a is allowed to be any positive real number. 
However, if a > 1, then only a vanishingly small proportion of all vertices have 
any acquired links at all. This is easy to show for the scenario where vertices 
are ranked according to age. We feel confident that the same holds for the other 
ranking schemes that lead to power law degree distributions. Since a scenario 
where almost all vertices have the minimum degree does not correspond to a 
typical self-organizing network, we have limited our analysis to the case where 
a G (0,1). 

As we will show, rank-based attachment leads to power law graphs for a variety 
of different ranking schemes. One obvious ranking scheme is to rank vertices by 
age (the old get richer); we show that this leads to a power law with the same 
exponent 1 + 1/a. A more general graph model with rank-based attachment based 
on ranking according to age is the protean graph model, which was proposed and 
explored by Luczak, Pralat and Wormald in [T21 dSJ [33]. It is also discussed 
in [7J and [8]. As a contrast, in this paper we also consider an inverse age ranking 
scheme, where younger vertices are ranked higher. As can be expected, this scheme 
is not likely to lead to a heavy tail degree distribution: with high probability, the 
maximum degree is of order logn, where n is the total number of vertices. 

In [TJ, a ranking scheme based on an external prestige label for each vertex 
is given, and it is shown through a heuristic analysis and simulations that this 
scheme also leads to power law graphs, with the same exponent. Precisely, each 
vertex at its birth is assigned a randomly chosen fitness i G (0, 1), and vertices 
are ranked according to their prestige label. As argued below, since the ranking is 
based only on the relative values of the fitness values, the distribution according 
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to which i is chosen is irrelevant (all distributions give equivalent graph generation 
processes). 

In order to allow for a different distribution of "prestige" over the vertices, we 
considered also a random ranking scheme. Here, each vertex is assigned an initial 
rank according to a given distribution. We consider distributions of the following 
form. Let Ri be the initial rank of a vertex born at time i. Then P(i2j < k) = 
(k/i) s . First we show that, if s — 1, then the situation is similar to the one 
described previously, and vertices with initial rank Ri exhibit behaviour as if they 
had received fitness Ri/i. Thus, we obtain a power law graph. 

Next, we consider the case where s > 1, so the rank of new vertices is biased 
towards the lower ranks (note that low rank refers to a vertex with high value of 
Ri). In this case, with high probability the rank of a vertex will remain close to 
its original value throughout the process, so the behaviour is similar to the case 
of ranking according to age, and we obtain a power law graph. If s < 1, so initial 
ranks are biased towards the higher ranks, then we show that vertices tend to drift 
towards the lower ranks, and the behaviour is similar to that of ranking according 
to inverse age, where no power law is likely to occur. Thus, the value of s = 1 
gives a sharp threshold for power law behaviour of the degree distribution. 

These results suggest an explanation for the power law degree distribution often 
observed in real-life networks such as the web graph, protein interaction networks, 
and social networks. The growth of such networks can be seen as governed by a 
rank-based attachment scheme, based on a ranking scheme that can be derived 
from a number of different factors such as age, degree, or fitness. The exponent of 
the power law is independent of these factors, but is rather a consequence of the 
attachment strength. In addition, rank-based attachment accentuates the differ- 
ence between higher ranked vertices: the difference in link probability between the 
vertices ranked 1 and 2 is much larger than that between the vertices ranked 100 
and 101. This again corresponds to our intuition of what constitutes a credible 
mechanism for link attachment. 

In order to establish the right attachment strength to model a given real-life 
network we should consider the following. In a graph in which the number of 
vertices of degree k decreases roughly as k~^ the fraction of vertices of degree at 
least k changes roughly as 

^0(r 7 ) = 0{k 1 ~ 1 ) . 

e>k 

Thus, in order to imitate this distribution the attachment strength a should be 
set to a ~ 1/(7 — 1). For the web graph the distribution of total degrees is, at 
this moment, unknown. However, the number of vertices of in-degree k decreases 
roughly as k~ 2A , while the fraction of vertices of out-degree k can be approximated 
by k~ 2 - 7 (cf., Broder et al. |4J). Thus, if the total degree of the graph is close to 
the distribution of in-degree, then the appropriate attachment strength for a rank- 
based model is a ~ 0.91. 
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2. 



Definitions 



In this section, we formally define the graph generation model based on rank- 
based attachment. The model produces a sequence {Gt}^i = {(Vt,E t )}^ 1 of 
undirected graphs, where t denotes time. Our model has two fixed parameters: 
initial degree d G N, and attachment strength a6 (0,1). At each time t, each 
vertex v G V t has rank r(v, t) G [t] (we use [t] to denote the set {1, 2, ... , t}). To 
obtain a proper ranking, the rank function r(-,t) : Vt — > [t] is a bijection for all t, 
so every vertex has a unique rank. In agreement with the common use of the word 
"rank", high rank refers to a vertex v for which r(v, t) is small: the highest ranked 
vertex is ranked number one, so has rank equal to 1; the lowest ranked vertex in 
V n has rank n. The initialization and update of the ranking is done according to 
a ranking scheme. Various ranking schemes are considered in this paper; we first 
give the general model, and then list the ranking schemes. 

For any < a < 1, we define the function g a : N — > M: 



Let G± = (Vi, Ei) = ({vi}, 0) be a fixed initial graph with a single vertex with 
d loops, and rank r(vi,l) = 1. For t > 1 we form G t from G t -\ according to the 
following rules: 

• Add a new vertex v t together with d edges from v t to existing vertices 
chosen randomly with weighted probabilities. The edges are added in d 
substeps. In each substep, one edge is added, and the probability that Vi is 
chosen as its endpoint (the link probability), equals r(vi, t— l)~ a /g a (t — 1). 

• Update the ranking function r(-,t) : V t — > [t] according to the ranking 
scheme. 

Our model allows for loops and multiple edges; there seems no reason to exclude 
them. However, there will not in general be very many of these, so excluding them 
can be shown not to affect our conclusions in any significant way. 

We now define the different ranking schemes that are considered in this paper. 

• Ranking by age: The vertex added at time t obtains a rank t and retains 
this rank. That is, r(vi, t) — i for i G [t\. 

• Ranking by inverse age: The vertex added at time t obtains a rank 
1, but its rank shifts by one each time a new vertex is added. That is, 
r(vi,t) = t - i + 1 for % G [t\. 

• Ranking by random labeling: The vertex added at time t obtains 
a label l(v t ) G (0, 1) chosen uniformly at random. Vertices are ranked 
according to their labels: if l(vi) < l(vj), then r(vi,t) < r(vj,t). 

• Random ranking: The vertex added at time t obtains an initial rank R t 
which is randomly chosen from [t] according to a prescribed distribution. 
Formally, let F : [0, 1] — > [0, 1] be any cumulative distribution function. 
Then for all k G [t], 



t 



t 



l-Q 




(1) 



F(R t <k) = F(k/t). 
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• Ranking by degree: After each time step t, vertices are ranked according 
to their degrees in G t , and ties are broken by age. Precisely, if deg{vi, t) > 
deg(vj, t) then r(vi, t) < r(vj, t), and if deg(v «, t) = deg(v j, t) then r(t>j, t) < 
r(vj,t) if % < j. 

In the rest of the paper, {Gt}^ is assumed to be a graph sequence generated 
by the rank-based attachment model, with ranking scheme as defined in each 
particular section, and d and a are assumed to be the initial degree and attachment 
strength parameters of the model as defined above. The results are generally about 
the degree distribution in G n , where the asymptotics are based on n tending to 
infinity. 

We will use the stronger notion of wep in favour of the more commonly used 
aas, since it simplifies some of our proofs. We say that an event holds with extreme 
probability (wep), if it holds with probability at least 1 — exp(— ©(log 2 n)) as 
n — > oo. Thus, if we consider a polynomial number of events that each holds 
wep, then wep all events hold. To combine this notion with asymptotic notations 
such as 0() and o(), we follow the conventions in [T7] . 



3. Ranking by degree 

The first ranking scheme we consider is the "ranking" version of preferential 
attachment: vertices with higher degree are ranked higher. That is, the rank 
function r(-,t) : V t — > [t] is determined by the degree sequence at time t: if 
deg(fj,t) > deg(vj,t), then r(vi,t) < r{yj,t); otherwise (that is, if deg(vi,t) = 
deg(vj,t)) r(vi,t) < r(vj,t) if % < j. In contrast to all other ranking schemes, this 
means that the rank of a vertex can change by more than one in each step; if the 
degree of a vertex changes from k to k + 1, then the change in its rank can be as 
large as the total number of vertices of degree k and k + 1. 

In this section only, in order to omit tedious details, we assume that d — 1. The 
general case can be studied in a similar way. 

For all t > 1 and k > 1, let Yk(t) denote the number of vertices of degree k in 
G t , and let Y< k (t) =Y 1 (t) + --- + Y k (t). (Note that Y (t) = for all t > 1.) Note 
that, at time t, the vertices of degree k have ranks starting at t — Y<k(t) + 1, and 
ending at t — F<^_x(t). When the edge VjV t+ i is added at time t + 1, the change 
in the Y^s has contributions from two sources, namely, the change in degree of 
vertex Vj, and the addition of vertex v t +i of degree 1. Note that the probability 
that a vertex of degree k receives a link in step t + 1 equals 



Thus, the following equations express the expected change in each time step: 




t-y< fc _ x (t) 



E(Y 1 (t + l)-Y 1 {t) | G t ) 



1 - 



ga^-g^t-Y^t)) 

9a(t) 
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and similarly, for all k > 2, 

E(Y k (t + 1) - Y k (t) | G t ) = 9 a (t-Y< k . 2 (t))- g a (t-Y< k ^(t)) 

9a(t) 

Pa(t-y<fe-l(*)) ~ 9a(t-Y< k (t)) 
9a(t) 

(Note that Y k (t) = for all k > t.) 

To analyze the behaviour of the Y h we use the differential equations method 
(see for a general survey [IS]). First, by interpolating variables Y^(t) by real func- 
tions and presuming that the changes in the functions are equal to their expected 
changes, the equations above can be turned into differential equations. The na- 
ture of the limiting behaviour as n — > oo can be emphasised by considering real 
functions Zi(x) to model the behaviour of -Y^xn). Using the approximation 01) , 
we obtain a system of differential equations: 

l-a 



z[(x) 



X 



E 7 =lW)\ n L E 7 =lW) 



z' k (x) = 1 1 - 3 Jy ' 1 -2(1 



X \ X 



+ [l- T,kj=1 x Zj{X) ^j for k > 2, 

where Zq(x) = for all x. The initial conditions are z k (0) = for all k (or 
more precisely, the right limit as x approaches zero equals zero). The solutions 
of these equations are z k {x) = c k x, where the constants c k are defined below. 

Let C k = 1 — J2j=i c j = Yl^k+i ]! so °k = Cfc-i — C k . Solving the differential 
equations one by one, we get the following recurrence: 

Co = 1 

d = \-c\- a 

C fc _i-C fc = C x k _ a 2 - 2C\Z$ + C\- a for k > 2. (2) 
The recurrence ([2]) is telescoping, so it can be simplified. Indeed, 

c k (i + c k a ) = C-^ + M + c-) 
= clzi - c\- a + C 2 (l + C 2 a ) 

1 — Oi 

u fc-l • 

Thus, C = 1 and C k + C^~ a = C\zt for fe > 1. 

Lemma 3.1. Lei i/ie sequence [C k : k > 0) be recursively defined so that Co = 1, 
and C k is the unique positive solution to the equation 

Ck + CI* = C£f. (3) 
T/ien C k = c a k~ x l a (\ + o(l)), where c a = {^) 1/a ■ 
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Proof. Note first that the function f(x) = x + x 1_a is concave and strictly increas- 
ing when x > 0, so Ck is well-defined and decreasing. Let Bk = C^k 1 ^ . We will 
show that Bk converges to a constant when k — > oo. Substituting the expression 
for Bk into ([3]) we obtain the following recurrence relation: 

Consider the function / fc : [0, oo) — > R given by 



1-a 



A (,) = x'- + £-(i + lf l T j 

The roots of the function are x = and x = x^, where 

i 

1 



1-a 

Using the Taylor expansion of (l + r^-A 01 , and considering u, k , 
Xk is a decreasing sequence for k > 2, with limit (I=H) 1//a = Cqi 

It is straightforward to verify that f' k {xk) > 0. Since fk{%k) — and x 1_a +x/k 
is an increasing function of x, it follows that > x& implies that Bk < Bk-\, 
and Bk-i < Xk implies that Bk > Bk-i- Using the recursive expression for Bk, we 
obtain that 

l-q 

h(B k ) = Bi- + f - (i + -L_) a BjT 

1-a 

i + ^y) " (Biz? - B\r a ) (4) 

Since fk is increasing, x > x^ if and only if fk(x) > /fc(xfc). Thus, if £>fc_i > Xk 
then -Bfe_i > Bk, so fk(Bk) > and thus Bk > Xk > Xk+i > c a . Therefore, if 
there exists a value k > 2 so that Bk -i > Xk , then : > k } is monotone 
decreasing, and bounded from below by c a . If no such k exists, then B^-i < X& 
for all k > 2, and thus 5fc is monotonically increasing, and bounded from above 
by a converging sequence. In both cases, the sequence Bk converges. 
From (j3J), we can then conclude that fk(Bk) — > 0, and thus 5^ — > 
— > oo □ 

Since = C^_i — the above lemma implies that Ck = (l/a)c a k~( 1+1 / a \ We 
will see in the rest of this section that the variables Yj indeed follow the behaviour 
suggested by the discussion above, as expressed in the following theorem. 

Let 

n a/(4a+2) 

K = Kin) = — , jtz —tt — . 

\og 2a/{2a+l) n 

Theorem 3.2. Wep for all k,l < k < K and t, 1 < t < n 

Y k {t) = c k n(l + o(l)) = {l/a)c a k l/a+l n(l + o(l)). 
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Note that if the random variable Y k (t) is smaller than c k t at some point of the 
process, then the probability that Y k {t) increases in the next step will go up, so 
the value will be increasing with a higher rate. Likewise, if Y k > c k t, then the 
probability that Y k {t) stays equal goes up, so Y k (t) tends to stay at the same value 
for longer. Thus, these random variables have a certain "self-repairing" quality, 
so we expect them to behave well. 

This suggests that, in order to show a concentration of Y k , the differential equa- 
tions method can be used. In this case, the full force of this method need not be 
used, but it suffices to use martingales, or bound the variables by stochastically 
dominating the behaviour using binomially distributed variables — these can be 
considered as primitive versions of the differential equations method. We present 
a technique based on a well-known Azuma-Hoeffding inequality (see for example 
Lemma 4.2 in [16]). 

Lemma 3.3. Let Xq, X\, . . . , X t be a supermartingale such that \Xj — -Xj_i| < Cj, 
1 < j < t, for constants Cj, Then for any £ > 



To avoid tedious repetition, we present the full proof of Theorem I3.2l for variable 
Yi only. A proof sketch will be given for the other cases. 

Theorem 3.4. Wep for all t,l < t < n 



Proof. Let X(t) = Yi(t) — C\t. Since we expect Yi(t) to stay around Cit, and thus 
X(t) to be a random variable close to a martingale. For any two time instances 
U <T, let us define event A(U,T) as follows as the conjunction of the following: 



(ii) X(t) is nonnegative at time t, for all U < t < T, and 
(iii) X(T) > £ = v /nlogn. 

Now we estimate the probability that A(U,T) holds for some fixed U, T, 1 < 
U <T<n. Let T\ be the smallest t > U so that X(t) < or t = T. For all t so 
that U < t < Ti, 



So for U < t < Tx 

E(X(t + 1) - X(t) | X(t)) < (1 -ci)ci -ci(l-ci) = 0. 

Thus, X(U),X(U + l),...,X(Ti) is a supermartingale where < 1 and 

\X(t + 1) - X(t)\ < 1. So from Lemma O it follows that 




Yi(t) = at + O(Vrclogn) . 



(i) X{U) e [0, 1) 




1 — c\ with probability (1 — c\ — X(t)/t) 
—C\ otherwise. 



< (i-c 1 y- a = c 1 



p(a(Tx) > A < exp(-£ 2 /2n) = exp(-6(log 2 n)). 



Since condition (ii) is equivalent to T\ = T, we have 

¥(A(U,T)) < P(Ti = T)P(X(Tx) > £| T = T{), 
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and by the previous argument the last probability is exponentially small. 

Similarly, we can define events B(U,T), applying conditions (i), (it) and (in) 
to -X(t). (So condition (i) of B(U,T) is that X(U) G (-1,0], etc.) It can 
then be shown in an analogous way that ¥(B(U,T)) < exp(— 0(log 2 n)) for any 
U, T, 1 < U < T < n. Since all events have small probability wep none of them 
occur. Indeed, 

E (^2Ia(u,t) +J2 T B(UX) J = 0(n 2 )exp(-e(log 2 n)) = exp(-0(log 2 n)) 

V U,T U,T J 

and this fact follows from Markov's inequality. Given that none of the events 
occur, the assertion holds deterministically. □ 

We can repeat the same argument for all Y^s (2 < k < K). Since the error 
terms are cumulating, in order to get an asymptotic behaviour K has to satisfy 
the following equation 

K^/n\ogn = K~ l ~ l l a n/ \ogn = o(cKn) . 

Note that K(n) as defined earlier satisfies this equation. This completes the sketch 
of the proof of Theorem 13.21 

4. Deterministic Ranking Schemes 

In this section we consider two ranking schemes that are deterministic, that is, 
the rank of a vertex r(vi,t) does not depend on Gt, but is completely determined 
by i and t. In this case, the events that v i receives a link in time step t are 
independent for all t. Thus, deg(t>j,n) is the sum of n — i independent Bernouilli 
trials with pre-determined probabilities. The general theory about such sums can 
be directly applied to obtain the results in this section. 



4.1. Ranking by Age. Ranking by age means that older vertices have a lower 
rank. Precisely, the rank of a vertex equals the time it is born, that is, r(vi,t) = i 
for all 1 < i < t. As mentioned in the Introduction, rank-based attachment 
with ranking by age is a special case of the growing protean graph model defined 
in [15]. The growing protean graph model is more general since it permits deletion 
of vertices. Theorem 5.1 and Theorem 5.2 in [15] give results for the degree of 
a vertex that apply to our model. However, the next theorem gives stronger 
concentration results since it is adopted to the special case that no deletion occurs. 

Theorem 4.1. For ranking by age, the expected degree of a vertex Vi, i G [n], is 
given by 

Edeg(vi,n) = (1 + 0(n-^-^))d^^ ( (-Y + ^ • 

cx V V z / 1 (y. J 

Moreover, ifi< n/ \og s ^ a n, then wep 

deg(v u n) = (l + 0(\og- 1/2 n))d- 1 



a \i 
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Proof. Let X(t,j) be a random indicator variable for an event that vertex vt joins 
Vi at substep j of step t (t G [n], j 6 [d]). 



P(X(t,j) = l) = l-P(X(t,j)=0) 



i~ a /g a (t) for t > i 
otherwise. 



The number of neighbours v t of such that t > i is a random variable and can be 
expressed as a sum Ylt=i+i Yuj=i-X-{t,j) of independent random variables. Since 
the number of neighbours v t of Vi such that t < i is always d, 

degfa.n) = d + Yl 

i=i+l 3=1 

Thus, using (pQ), 

n i _a " 1 - a 

E deg(, i; n) = d + dY: ^3T) = rf + d ^ E t i-a + 0(1) • 
t=i+i yQV ' *=i+i v ' 

Assuming that i > n a ^ 3 , we get that 

n 

Edegfa.n) = d+ (l + 0{n a ( a - 1)/3 ))d{l-a)r a J^t"- 1 

t=i+l 

= d + (1 + 0(n a{a - 1)/3 ))d^—^r a (n a -i a + 0(i a - 1 )) 
A similar calculation can be done for i < n Q//3 , noting the fact that 



Edeg(^,n) = 0{n a ' 3 ) + dt~ a ^ — - 



a 



t=n«/3 V ' 

In order to finish the proof, we use the fact that a sum of independent random 
variables with large enough expected value is not too far from its mean (see, for 
example, Theorem 2.8 in [10]). From this it follows that, if e < 3/2, then 

f e 2 \ 

P (| deg(t>i, n) — E deg(t> «, n)| > eE deg(t>j, n)) < 2 exp I — — Edeg(t> «, n) J . (5) 

Note that E deg(wj, n) = fi(log 3 n) for i < n/ log 3//Qf n. If we let e = logn/ a/E deg(t>j, n) 
in we get that wep deg(i>j, n) = (l + 0(£))Edeg(i>j, n) and the assertion fol- 
lows. □ 

Observe that, for small i, the expected degree of a vertex vi is dominated by 
the factor d^f- (") . Consequently, the degrees are distributed according to the 
power law. More specifically, let = Zk(n,d,a) denote the number of vertices 
of degree k and Z>k = J2i>k The following theorem shows that the Z>k follow 
a power law with exponent 1/a. Since the Z> k represent the cumulative degree 
distribution, this implies that the degree distribution follows a power law with 
exponent 1 + 1/a. 
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Theorem 4.2. Let < a < 1 and d G N, k > log n. Then wep 

l/a 



Z it= (l +0 (log-'" n ))n(i^.|) 



Proof. This theorem is a simple consequence of Theorem 14.11 One can show that 
wep each vertex V{ such that 

l/a 



has fewer than k neighbours, and each vertex V{ for which 

l/a 



< (1 - log" 1 / 3 n)n(— 4) 
\ a k J 



has more than k neighbours. □ 



4.2. Ranking by inverse age. To contrast the other schemes, we considered 
a scheme where new vertices are ranked the highest. Precisely, r(vi,t) = t — i + 
1. Intuitively, this scheme breaks the effect of "cumulative advantage", since no 
vertex has high rank long enough to accumulate a high degree. The results from 
this section give evidence that, indeed, this scheme does not lead to a power law 
degree distribution. 
Note that 

EdegKn) = d + dJ2 [ , \, = (l + o(l))d(l-a)^- 

t=2 ' t=i 

= (l + o(l))d(l-a)logn 

and also it is not hard to see that Edeg(fj,n) > Edeg(t>j,n) for 1 < % < j < n. 
Thus Edeg(fj, n) < (1 + o(l))d(l — a) logn for all i G [n]. Again deg(fj, n) can be 
expressed as a sum of independent — 1 random variables but since the expected 
degree is so low we cannot hope for concentration; the Chernoff bound only tells 
us that wep the maximum degree of G n is Edeg(t>i,n) + O(logn). 

We also show that the number of vertices with expected degree at least k de- 
creases exponentially with k. This suggests that the degree distribution does not 
follow a power law. 

Theorem 4.3. Let < a < 1, d G N ; and i = i(n) G [n]. The expected degree of 
a vertex Vi satisfies the following inequalities 

Edeg(wj, n) > d+ (1 + o(l))d(l — a)ailog(n — i) 
Edeg(vi,n) < d+ (1 + o(l))d(l - a) log(n - i) . 

Proof. Define 

n—i—l j 



m = E 
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Then, using the approach as in Theorem 14.11 we obtain, 

Edeg(v u n) = d + dj] 7- ~jr 

ttlti 9a(t ~ 1) 

= d + (1 + o(l))d(l - a) £ A ni-a 

t=i+l ^ ' 

= d + (1 + o(l))d(l - a)/(i) , 
for any i G [n] . The assertion follows from the fact that 

rn-i i y-n-t / • _ i \ a i 

/(i) > / t rr— dr = / 1 + - — - ——dx 

~ Jo (x + l) a (x + i) l - a J V x + iy x + i 

> / 1 + a = / 1 dx 

Jo V x + l/x + z Jo \x + i x + 1 / 

= (1 — a) log(n/i) + a;log(n — i + 1) > alog(n — i) 



n-i-l l nn-i-l + 1 



and 

/(i) < 1 + / t — -t— rfx = 1 + / ( : 1 I -^-dx 

n — t \ 

logfn — i) < logfn — i) + 1 . 

n- iy 

Corollary 4.4. Let < ot < 1, d e N. 



: Edeg(fj, n) > k} > n — (1 + o(l)) exp 
: Edeg(f j, n) > A;} < n — (1 + o(l)) exp 

5. Random ranking 



k — d 



d{l — a) a 

k — d 
d(l — a) 



□ 



In the two ranking schemes discussed in this section, the initial rank r(v t , t) of 
a new vertex t> t is a random variable Rt G [£]. The new rank function is simply 
formed by inserting the new vertex into the existing ranking, so for all j G [t — 1], 
r(vj, t) = r(vj,t — 1) if r(vj, t — 1) < Rt, and r(vj, t) = r(vj,t — 1) + 1 otherwise. 
The difference in the two schemes lies in the way that R t is chosen: in the first 
scheme, the rank of each vertex is based on a fixed, but randomly chosen, label, 
while in the second scheme, R t is randomly drawn from [t]. 

5.1. Ranking by random labeling. In this scheme, each new vertex vt obtains 
a label l(vt) G (0, 1) chosen uniformly at random. (Note that the probability that 
two vertices receive the same label is zero.) Vertices are ranked by their labels: if 
l{vi) < l(vj), then r(vi,t) < r(vj,t). 

First we note that the process of choosing a label uar from (0, 1) does not imply 
loss of generality. Namely, suppose that the labels are chosen from R according 
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to any probability distribution with a strictly increasing cumulative distribution 
function F. Since F is an increasing function, labels F(l(vi)) lead to exactly 
the same ranking as labels Ifa). But V(F(l(vi)) < x) = V(l(vi) < F~ 1 (x)) = 
= x, so the values of labels F(l(vi)) are chosen from (0, 1) according 
to the uniform distribution. 

Since the arguments used here are largely similar to those used in Section H] 
(namely, applying the Chernoff bounds for a sum of independent random vari- 
ables), we omit technical details. 

Theorem 5.1. Let < a < 1, d 6 N, i = i{n) G [n\, and < l{vi) = l(vi)(n) < 1. 
Ifi-liyi) > log 3 n ; then the expected degree of a vertex Vi that obtained a label l(vi), 
is given by 

E deg( Vi , n) = d+(l + 0{log- {1 ' a)/2 n))d(l - a)l{vi)~ a log(n/i) , 
and wep 

deg(vi,n) =Edeg(vi,n) + 0(a/E deg(vi, n) logn) . 

Proof. Note that r(vi, t) is the sum of independent indicator variables of the events 
l(vj) < l(vi) for % < j < t. Using large deviation inequalities and the fact that 
i ■ l(vi) > log 3 n, we get that wep for all % < t < n, 

t-l(vi){l-\og~ l/2 n) < r(vi,t) <t ■ l(vi) (1 + log~ 1/2 n) . 

Thus, 

Ed e g („ n) = d+d ^ («-'W(i + y"))r 

t=i+i ' 



n 

d+ {I + Oilog-t 1 "^ 2 n))d{l - a)l{ Vi )- a ~ 



t 

t=i+l 

= d+(l + 0{\og- (1 ~ a)/2 n))d(l - a)l( Vi )~ a bg(n/i) . 

Since deg(vi,n) can be expressed as a sum of independent random variables, we 
can use the Chernoff bound to show the concentration result. □ 

Using the notation from Section H] we present the main result. 

Theorem 5.2. Let < a < 1 and d e N, log 3 n < k < n a/2 / log 3a n. Then wep 

z* = (i + «i>)»(^fr(i + i 

Proof. From Theorem 15.11 and the fact that k > log 3 n it follows that wep all 
vertices i>j such that i > ik = k x l a log 4 n and 

l{vd > (l + log-^)/ 3 n) ^(1 - a)log(n/i) y a 
has fewer than k — d = (1 + o(l))k neighbours, and each vertex Vi for which 
Ifa) < (l-\og- {1 - a)/3 n) 



d(l — a) log(n/i) x 1 



k 
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has more than k neighbours. 

Let Xf, X~ , i e [n], be a family of independent random variables such that 

f (1 + l0£- (1 - a)/3 n) ( ^-^^(n/i) ) 1/a f • > • 

P(X+ = 1) = l-PpT+ = 0) = ^ + ° g U) \ k ) " k 

I 1 otherwise, 

and 

P(Xr = 1) = l-P(Xr = 0) = l [ ° g n H fe 7 " fe 

I otherwise. 

Then, Z> k can be bounded from below by X~ = Yli=i -^i an d f rom above by 
A' + =Et,A?. Thus, 

EZ 2 , = 0(if/- log' ») + (1 + o(l)) ± ( - " ) J 08( " /i) ) V ° 

= 0(k 1/a log 4 n) + (1 + o(l)) (^^) n jf ' (log(l/^)) 1/a ^ , 
and putting u = log(l/x) we get 

EZ> k = 0(k 1/a \og 4 n) + (l + o(l)) ^ 1 ~ a ^ 1 nj\ l/a e- u du 
, ( d(l - a)\ 1/a ^ (I 

where r(x) denotes the (complete) gamma function. Since the gamma function is 
an extension of the factorial, and is increasing for x > 2, T(l/a + 1) is a constant 
which lies between [1 /ctj ! and 

Since k < n a l 2 / log 3a n, EZ> fc = Q(y/n log 3 n). Using large deviation inequali- 
ties one more time, we can show that whp Z> k = (1 + o(l))KZ> k . This finishes 
the proof of the theorem. □ 



5.2. Randomly chosen initial rank. Next, we consider the case where the 
rank of the new vertex i> t , R t — r(vt, t), is chosen at random from [t]. As described 
earlier, the ranks of existing vertices are adjusted accordingly. In contrast to the 
previous scheme, in this case it now matters according to which distribution R t 
is chosen. We make the assumption that all initial ranks are chosen according 
to a similar distribution. In particular, we fix a continuous bijective function 
F : [0, 1] — > [0, 1], and for all integers 1 < k < t, we let 

W(R t <k)=F 

Thus, F represents the limit, for t going to infinity, of the cumulative distribution 
functions of the variables Rt. To simplify the calculations while exploring a wide 
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array of possibilities for F, we assume F to be of the form 

F(x) = x s , where s > 0. 

A special case is the case s — 1, where the distribution of each R t is uniform. We 
will see that this case is the threshold for a power law degree distribution; if s < 1, 
then the probability that a new vertex receives a high rank (that is, a low value of 
Rt) is enhanced, and thus we get behaviour similar to that of age-based ranking, 



as seen in Subsection 14.11 including a power law degree distribution; if s > 1, then 
the probability that a new vertex receives a high rank is diminished, and we get 
behaviour similar to the inverse age ranking scheme described in Subsection 14.21 
To study the degree of a given vertex vi in G n under this ranking scheme we 
again use the differential equations method. Assume that vertex t>i obtained an 
initial rank R^. Then r(fj, t), t > i, is a random variable, which in time step t 
increases by one precisely when R t < r(vi,t — 1). Since the latter happens with 
probability F(r(vi,t — l)/t), we have that 

E(r(« 4) t) - r(v h t - 1) | G t ) = ^ t ~ 1 ^ ' . (6) 

Using a real function z(x) to model the behaviour of r(fj, xn)/n, the above equa- 
tion leads to the following differential equation for z: 

**> = e> 

with the initial condition z(i/n) = R^/n. 

If s = 1, the general solution is z(x) = Cx, C G R and the particular solution 
is z(x) = &x. This suggests that a random variable r(vi,t) should be close to a 
deterministic function &t. We will use martingales to show that this is indeed 
the case. 

Let Y t = r< f^ for all i < t < n. The sequence {Y t : i < t < n} is a martingale 
with respect to the random process {Gt}- Namely, 

E(K* I Y t ^) = t + 1 V ' = l t ' t J = Yt-,- 

In order to show a concentration for Y t , and thus for r(v t), we use a well-known 
Azuma-Hoeffding inequality (see for example Lemma 4.1 in [16]). 

Lemma 5.3. Let X ,Xi,...,X t be a martingale such that \Xj — Xj-\\ < Cj, 
1 < j < t, for constants Cj, Then for any a > 

¥(\X t - X \ >a)< 2exp (- 

Now, we are ready to state a concentration theorem. 

Theorem 5.4. Let i = i(n) E [n], R4 = Ri(n) such that 1 < Ri < i and R%/(i + 
1) > log 4 n. If the vertex Vi obtained an initial rank Ri, and l(vj) = Ri/(i + 1), 
then wep 

t-l{vi){\ -\og' 1/2 n) <r{ Vi ,t) <t-l{ Vi ){l + log' 1/2 n) 
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for all t,i < t < n. 

Proof. Let Y t be a random variable defined as before. Note that l{vj) =Yi. It has 
been shown that {Y t } is a martingale and it is also easy to see that \Y t — Yt-i\ < \. 
Since 

2 



(?) =jf ^ + 0(1) = (I) 



we can apply Lemma I3U1 with c t = ~ and a = to obtain that wep \Y t — Yi\ < a. 
So wep 

Y ' = y; ( 1 + (t)h ,w (' + G&) ) = iW(1 + ° (lof 1 n)) ' 

Moreover, one can use a concept of a stopping time (see Section 12.4 in [9]) to 
show that wep all values of Y t , i < t < n, lie within the bounds given by the 
equation above. This finishes the proof. □ 

Since the proofs of Theorems 15.51 and 15.61 are almost the same as the proofs of 
Theorems 15.11 and I5.2[ we omit them stating the results only. 

Theorem 5.5. Let < a < 1, d G N ; % — i(n) E [n], Ri = Ri(n) such that 1 < 
Ri < i- If the vertex Vi obtained an initial rank Ri such that R%/(i + 1) > log 4 n, 
and l(vi) = Ri/(i + 1), then the expected degree of Vi is 

E deg(vi, n) = d+(l + O(log" (1 ~ o)/2 n))d{\ - a)l(vi)- a log(n/z) . 

and wep 

deg(vi,n) =Edeg(vi,n) + 0(a/E deg(vi, n) \ogn) . 
Theorem 5.6. Let < a < 1 and d G N, log 3 n <k < n"' 3 / log 2 " n. Then wep 

^ = (i + (i))„^A_Jj r(- + i 

Note that Theorem 15.21 and 15.61 suggest that the random ranking scheme with 
uniform distribution behaves similarly to the random labeling scheme. However, 
the upper bound values k for which the power law holds is lower in this case. 
This can be explained by the fact that the eventual rank of a vertex is not always 
easy to predict in this scheme. For example, assume that in the random labeling 
scheme, the first vertex obtained a label of 1/2. Then its rank at time n is almost 
surely close to its expected value of n/2. In the current scheme, if a vertex has 
initial rank Ri/2i, then the expected rank at time n also equals n/2 but this rank 
is not concentrated. Namely, the rank behaves like the proportion of white balls 
in Polya's urn problem, and thus r(vi,n)/n converges to a random variable with 
uniform distribution on [0, 1]. 

Next, we consider the case where s > 1. The general solution of the differential 
equation (jTJ) is z(x) = (x 1 ^ 13 + C)~ 1 / ( - s ~ 1 ' ) . Thus, we expect r{v^t) to be approxi- 
mately equal to nz(t/n) = + c) -1 / 5-1 . Note that, if t gets large, this function 
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converges to a constant. As we will see, the definition below captures the value of 
this constant. 

B* = (R$-'-(i + l) 1 -')- 1/i '- 1) . (8) 

Theorem 5.7. For all i > n 1 ^ 2 log s+1 n, if vertex Vi has initial rank Ri so that 
n 1 / 2 \og s+l n < R{ < (1 — log 1 " 5 n)i, then wep 

-i 

r(v i ,t) = R*(^l+(J^y J (l + Oaog^n)). 

The proof uses the supermartingale method of Pittel et al. [15] . as described 
in [161 Corollary 4.1]. We need the following lemma. 

Lemma 5.8. Let Gq, G\, . . . , G n be a random process and X t a random variable 
determined by Gq, G\, . . . , Gt, < t < n. Suppose that for some real (3 and 
constants "ft, 

~E(X t — X t -i | Go, Gi, . . . , Gt-x) < (3 

and 

\X t - X t _ x - (3\ < 7t 
for 1 < t < n. Then for all a > 0, 

F(For some t with < t < n : X t - X > td + a) < exp ( =^ ^ . 

Proof of Theorem \5.7\ We transform r(v{, t) into something close to a martingale. 
Consider the following real- valued function 

H(x,y)=y 1 - s -(x + l) l - s (9) 

Let w t = (t, r(vi,t)), and consider the sequence of random variables (i?(w t ) : i < 
t < n). Note that H(i,Ri) = We will show that wep H(t,r(vi,t)) is 

close to H(i, Ri). The function H is chosen so that if(w) is constant along every 
trajectory w of the differential equation ([7]). 
Note that 

grad H(w t ) = (-(1 -s)(t+ l)" s , (1 - s)r{v h t)~ s ) , 

It follows from the choice of H, and can be checked using §6§, that 

E(w t+1 - w t | G t ) ■ grad H(w t ) = 0, 

Using the fact that Ri < r(vi,t) < t at all times, we can show that all second- 
order partial derivatives of H evaluated at w t are 0(R; {s+1) ). Therefore, 

H(w t+1 ) - H(w t ) = (w t+1 - w t ) • grad H(w t ) + 0(R7 {s+1) ). (10) 
Taking the expectation of (TTDT) conditional on Gt, we obtain that 

E(H(w t+1 ) - H(w t ) | G t ) = 0(R; (s+1) ). 
The rank changes by at most one in each step, so from the above, we obtain 
\H(w t+1 )-H(w t )\ < (5-l)(r( % ,t)- s + (t+ir s )+0(i2- (s+1) ) 
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Now we may apply Lemma 15.81 to the sequence (H(w t ) : % < t < n), and 
symmetrically to (—H(w t ) : i < t < n), with a = R~ s n 1 ^ 2 \ogn, (3 = 0{R i <s+1 ' ) ), 
and 74 = 0(R^ S ). From the lower bound on Ri it follows that n[3 = 0(a), and we 
obtain that wep 

\H{w t ) -H{wi)\ = 0{Rr s n 1 ' 2 log n) 
for i < t < n. As H(wi) = (i?*) 1_s , this implies from the definition (Q of the 
function H, that wep 

r{ Vi) t) l - s = (R*) 1 - 3 + (t + l) 1 - s + 0(Rr s n 1/2 logn) 

= ((R*) 1 -* + (t+ I) 1 '*) (1 + O^R^R^n 1 ' 2 log n) 

for i < t < n, so 

-i 

r(v i ,t) = R* (l+ (J^y 1 (l + 0((R:y- l Rrn^ 2 \ogn)). 

Since < (1 — log 1_,s n), we have that 

R*/R i = 0(\ogn). (11) 

Since i?j > n 1 / 2 log s+1 n we have that 

{R^R-'n 112 \ogn = (i^V /2 log 5 n) = ©(log" 1 n) 

which finishes the proof of the theorem. □ 
We can now use the same approach as for age-based ranking. 

Theorem 5.9. For a vertex Vi so that n 1 ^ 2 log s+1 n < R4 < (1 — log 1 ^ s n)i and 
R* < n log~ 3 / Q n, wep, 

deg(^n) = {l + OiXog-^^-^nyd-^- (± 

1 - a \R { 

Moreover, for all vertices v i} wep 

deg(vi,n) = d- — - ( ] +0(log 2 n). 



a \Ri, 

Proof. The proof follows the same reasoning as the proof of Theorem 14. 11 To prove 
the first part, using Theorem 15. 7\ we obtain the expected degree of Vi at time n 
as follows: 

Edeg(«i,n) = Edeg( Vi ,2R*) + d ^ '''''' 



1 -■:>!-; -i-i 9a(t) 



For the first term, we use the fact that r(v j, t) > Ri for all t > i. 

9a(t) 

d + (1 + o{l))dRr a (l -a) ^2 t a ~ l 



2R* 

Edeg^i?*) < d + d J2 %S (12) 

t=i+l " 



2R* 



t=i+l 

om/Rin 
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Since (R*/Ri) = O(logn) (see ( JTlTl ) and, by assumption, n/R* > log 3// °n, we have 
that 

Edeg{vi,2R*) = 0{\og a n) = 0({n/R*) a log" 2 n). 
Now, we can estimate the second part as follows: 

r{v h t)- a 



d £ 



/:-:_'/,':+.! 9a{t) 



Jl + OOog- 1 n))d(l - a)(R-)-" JJ 



i=2R*+l 
n 

(1 + OOog" 1 n))d(l - a)(R*)- a £ + ©((iZJ)- 1 ^-)) 

t=2R*+l 

(1 + OOog- 1 n))d(l - a)(R*y a (n a /a - 0((2#*) Q ) + 0((i?*r V^ 1 )) 
(l + Oaog-^^-^n))^ 1- " / " " 



1 — a ( n 



a \R* 

since R* < n log~ 3//a n. Therefore, 

Edeg^n) = (1 + 0(log- min{1 ' 3(s - 1)/a} n))rf 

Using the Chernoff bound as before (see ([51)), together with the fact that 
Edeg(ui) = f2(log 3 n) for R* < n log~ 3//a n, and putting e = log n/ ^/E deg(fj) 
in ([5]), we get that wep deg(vj) = (l + 0(£))Edeg(t>j) and the assertion follows. 

To prove the second part, we can use a calculation similar to ([121) to show that 

Edeg(fj,n) = d (^^) (jt^J + ^(1)' anc ^ use the Chernoff bound to prove the 
statement of the theorem. □ 

Theorem 5.10. Let k be so that log 4 n < k < n a ^ 2 \og~ a ^ s+3 ^ n. For random 
ranking with initial rank distribution given by F where F(x) = x s and s > 1, 

Z> _ k =(l + 0{lo£-°n))n(^. d ^ y \ 

Proof. Let w(n) = n 1 ' 2 log s+1 n. Fix k so that i og max (3«,3/a+2) < fc < nQ / 2 log -«(*+3) n> 
Define sets and as follows: 



= { Bj |K<(i-io g - n ) n (i^.^) 

S+ = L | **>(! + log 1 - „)„(i^.0 



1/a' 



1/a' 



We will argue below that wep all but a small fraction of the vertices in S k have 
degree at least k, and in S£ have degree less than k. First, we estimate the size 



of S k and S£. 
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Let f(k) be a function so that f(k) = 0(fc _1//Q ra); f(k) is meant to represent 
the bound on R* that defines S£ or S k . The bounds on k imply that f{k) = 
0(n log -3 n) and f(k) = VL(uo(n)\og 2 (n)). 

From ©, < /(fc) if and only if Ri < ((i + l) 1 " 5 + /(A;) 1 - 5 )- 1 /^" 1 ). Thus 

— s 

for any i, the probability that R* < f(k) equals ^1 + ({Jf)* J (1 + \)- The 
expected number of vertices Vi so that -R* < f{k) is expressed by the following 



sum: 



rn//(*) 

= /(fc) / (l+y 5 - 1 ) — dy + 0(logn) 
= /(*)(l + 0(n- 1 ' a )). 

i — s 

The last step can be explained as follows. The antiderivative of (1 + w J 8 - 1 
equals (1 + y 1 ^ 11 ) 1 ^ , and thus J °°(l + y a ~ v )'* =l dy = 1. The integral from 
to l/f(k) is at most l/f(k) = 0(rT l l 2 ). The integral from n/f{k) to infinity 

equals ^-j- {^^j (1 + o(l)) = o(l). The result then follows because /(&) = 
^(n 1 / 2 logn). 

Using the Chernoff bound, and the lower bound on f(k), it follows that wep, 
the number of vertices with R* < f(k) equals (l + 0(log~ 1 n)f(k). Therefore, wep 



K |=(l-V-»)n(^-3 



1/a 



while the number of vertices that is neither in S£ nor in S k is ©(log 1 s \S k |). 

Consider the vertices in S k . Let f(k) = (l — log 1_s njn . i^ 1 Q . From the 
bounds on k it follows that R* < f(k) = 0(n\og~ 3 n), so we may assume that 
R* < n log -3 /" n. By Theorem 15. 9[ if u{n) < Ri < (1 — log 1_s n)i, then wep 
deg(fj, n) > k. 

Consider the vertices in S k with initial rank Ri < u(n). Since lower initial rank 
will wep lead to higher degree, and since deg(t>j,n) would have been at least k 
even if the initial rank Ri had been 2u)(n), we can conclude that these vertices 
also have degree at least k. 

If Vi E S k and Ri > (1 — log 1 ^ s n)i, then this implies that % < i^, where is so 
that 

(1- log 1 " 8 n)i k = ((i i + l) 1 - + f(k) 1 -)- 1/l '- 1) 

X _ s \ -l/(»-l) 



(i k + l). 



It is straightforward to verify that ik = Q(f(k) log 1 n). Since | = 0(/(fc)), 
the number of vertices in 5r that c?o not have degree at least k is OdS^I log -1 n). 



RANK-BASED ATTACHMENT LEADS TO POWER LAW GRAPHS 



21 



Next, consider the vertices in S£- From the bounds on k, it follows that R* = 
Q(uj(n) log 2 (n)). If R4 < (1 — log 1- s n)i, then R*/Ri = O(logn), and thus we may 
assume that Ri > u>(n). 

If R* < nlog~ 3/a n and uj(n) < Ri < (1 — log 1 s n)i then, by Theorem 15.91 wep 
deg(fj,n) < k. If R* > n\og~ 3 ^ a n, then, by the second part of Theorem 15. 9[ wep 
deg(vi,n) = 0(\og 3+a n) < k. 

This time, let f(k) = (l + log 1 " 5 n)n (^f . j.) 1 ^ . if r. > (1 _ fog 1 " 5 ^ 
and % > 3f(k) then we may assume that Ri > 2f(k), and, using the second 
part of Theorem 15. 9[ we find that wep deg(uj,n) < k. The probability that 
Ri > (1 — log 1_s n)i equals 1 — (1 — log 1_s n) s < slog ~ s n. So the expected 
number of vertices i>, with i < 3f(k) and R4 > (1 — log 1-5 n)i is 0(log 1_s nf(k)), 
and, using the Chernoff bound again, we can conclude that wep the actual number 
is at most of the same order. Since f(k) and \S^\ are both 6(fc~ 1/ ' a n), we can 
conclude that the total number of vertices in S£ that do not have degree less than 
k is 0(|<S^| log ~ s n). This completes the proof of the theorem. □ 

If s < 1, then the solution of the differential equation (J7J) is the same as for the 
case that s < 1. Using methods almost identical to the ones used for the case 
where s > 1, we can show that wep the rank is close to the one suggested by the 
differential equation. 

Theorem 5.11. For all i > n 1 ^ 2 log s+1 n, if vertex V\ has initial rank Ri so that 
n 1 / 2 log s+1 n < Ri, then wep 

r(v h t) = ft - ^Y~~ ^) (l + O^n^-^logn)). 

Proof. The initial part of the proof is identical to the proof of Theorem 15.71 and 
is thus omitted. Using the differential equation method, we can show that 



r^t) 1 -* = {R*) 1 - 3 + {t+l) 1 - s + 0{R- s n 1 / 2 logn) 

= ((it!*) 1 " 5 + (t+ l) 1 -*) (1 + 0{t s - l Rr s n^ 2 logn)) 

= {(R*) 1 -* + (t+ l) 1 " 5 ) (1 + 0(t s - V 1 -^ 2 logn)) 

for i + 1 < t < n, so 



1 

r{v h t) = ^f(f) 1 8 + ("T") 1 'J "(l + OC^n^logn)) 
= (t+ (l+Oft'-^-^logn)). 



□ 



Thus, the rank of a vertex at time t tends to be close to t, which means we are 
in a situation similar to the inverse age case. In emulation of Theorem 14. 3[ we can 
show that almost all vertices have expected degree at most O(logn). Since the 
proof is almost identical to the proof of Theorem 14.31 it is omitted. 
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Theorem 5.12. Let < a < I, d e N, and i > n 1/2 log 2(1_s) n. Then 

Edeg(vi,n) = O(logra). 
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